Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 786600 |
| Missing cells | 24767 |
| Missing cells (%) | 0.2% |
| Duplicate rows | 546 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 78.0 MiB |
| Average record size in memory | 104.0 B |
Variable types
| Categorical | 4 |
|---|---|
| Numeric | 9 |
| Dataset has 546 (0.1%) duplicate rows | Duplicates |
customer_id has a high cardinality: 245455 distinct values | High cardinality |
order_date has a high cardinality: 776 distinct values | High cardinality |
customer_order_rank has 24767 (3.1%) missing values | Missing |
voucher_amount is highly skewed (γ1 = 30.39394065) | Skewed |
platform_id is highly skewed (γ1 = -22.53663783) | Skewed |
voucher_amount has 743462 (94.5%) zeros | Zeros |
delivery_fee has 597536 (76.0%) zeros | Zeros |
Reproduction
| Analysis started | 2021-02-25 23:03:01.794157 |
|---|---|
| Analysis finished | 2021-02-25 23:03:58.065411 |
| Duration | 56.27 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 245455 |
|---|---|
| Distinct (%) | 31.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| 15edce943edd | 386 |
|---|---|
| 8745a335e9cf | 288 |
| d956116d863d | 286 |
| 0063666607bb | 273 |
| ae60dce05485 | 270 |
| Other values (245450) |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Characters and Unicode
| Total characters | 9439200 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 145498 ? |
|---|---|
| Unique (%) | 18.5% |
Sample
| 1st row | 000097eabfd9 |
|---|---|
| 2nd row | 0000e2c6d9be |
| 3rd row | 000133bb597f |
| 4th row | 00018269939b |
| 5th row | 0001a00468a6 |
| Value | Count | Frequency (%) |
| 15edce943edd | 386 | < 0.1% |
| 8745a335e9cf | 288 | < 0.1% |
| d956116d863d | 286 | < 0.1% |
| 0063666607bb | 273 | < 0.1% |
| ae60dce05485 | 270 | < 0.1% |
| a54a8e1579d4 | 254 | < 0.1% |
| bebb751d49b8 | 253 | < 0.1% |
| 26ed6389a3aa | 245 | < 0.1% |
| ef6265f74aca | 229 | < 0.1% |
| a333fb175a0c | 221 | < 0.1% |
| Other values (245445) | 783895 |
| Value | Count | Frequency (%) |
| 15edce943edd | 386 | < 0.1% |
| 8745a335e9cf | 288 | < 0.1% |
| d956116d863d | 286 | < 0.1% |
| 0063666607bb | 273 | < 0.1% |
| ae60dce05485 | 270 | < 0.1% |
| a54a8e1579d4 | 254 | < 0.1% |
| bebb751d49b8 | 253 | < 0.1% |
| 26ed6389a3aa | 245 | < 0.1% |
| ef6265f74aca | 229 | < 0.1% |
| a333fb175a0c | 221 | < 0.1% |
| Other values (245445) | 783895 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 594904 | 6.3% |
| 4 | 594873 | 6.3% |
| 0 | 594664 | 6.3% |
| 8 | 591857 | 6.3% |
| b | 591838 | 6.3% |
| d | 590973 | 6.3% |
| 5 | 590840 | 6.3% |
| e | 589501 | 6.2% |
| 2 | 589442 | 6.2% |
| 3 | 589119 | 6.2% |
| Other values (6) | 3521189 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5907912 | |
| Lowercase Letter | 3531288 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 6 | 594904 | |
| 4 | 594873 | |
| 0 | 594664 | |
| 8 | 591857 | |
| 5 | 590840 | |
| 2 | 589442 | |
| 3 | 589119 | |
| 7 | 587458 | |
| 9 | 587405 | |
| 1 | 587350 |
| Value | Count | Frequency (%) |
| b | 591838 | |
| d | 590973 | |
| e | 589501 | |
| f | 589006 | |
| a | 586882 | |
| c | 583088 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5907912 | |
| Latin | 3531288 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 6 | 594904 | |
| 4 | 594873 | |
| 0 | 594664 | |
| 8 | 591857 | |
| 5 | 590840 | |
| 2 | 589442 | |
| 3 | 589119 | |
| 7 | 587458 | |
| 9 | 587405 | |
| 1 | 587350 |
| Value | Count | Frequency (%) |
| b | 591838 | |
| d | 590973 | |
| e | 589501 | |
| f | 589006 | |
| a | 586882 | |
| c | 583088 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9439200 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 6 | 594904 | 6.3% |
| 4 | 594873 | 6.3% |
| 0 | 594664 | 6.3% |
| 8 | 591857 | 6.3% |
| b | 591838 | 6.3% |
| d | 590973 | 6.3% |
| 5 | 590840 | 6.3% |
| e | 589501 | 6.2% |
| 2 | 589442 | 6.2% |
| 3 | 589119 | 6.2% |
| Other values (6) | 3521189 |
| Distinct | 776 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| 2017-01-01 | 4230 |
|---|---|
| 2016-12-18 | 3395 |
| 2017-02-26 | 3234 |
| 2017-02-05 | 3218 |
| 2017-02-12 | 3125 |
| Other values (771) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 7866000 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 41 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2015-06-20 |
|---|---|
| 2nd row | 2016-01-29 |
| 3rd row | 2017-02-26 |
| 4th row | 2017-02-05 |
| 5th row | 2015-08-04 |
| Value | Count | Frequency (%) |
| 2017-01-01 | 4230 | 0.5% |
| 2016-12-18 | 3395 | 0.4% |
| 2017-02-26 | 3234 | 0.4% |
| 2017-02-05 | 3218 | 0.4% |
| 2017-02-12 | 3125 | 0.4% |
| 2016-12-11 | 3100 | 0.4% |
| 2016-12-04 | 3075 | 0.4% |
| 2017-01-22 | 3005 | 0.4% |
| 2017-01-29 | 3003 | 0.4% |
| 2016-10-03 | 2999 | 0.4% |
| Other values (766) | 754216 |
| Value | Count | Frequency (%) |
| 2017-01-01 | 4230 | 0.5% |
| 2016-12-18 | 3395 | 0.4% |
| 2017-02-26 | 3234 | 0.4% |
| 2017-02-05 | 3218 | 0.4% |
| 2017-02-12 | 3125 | 0.4% |
| 2016-12-11 | 3100 | 0.4% |
| 2016-12-04 | 3075 | 0.4% |
| 2017-01-22 | 3005 | 0.4% |
| 2017-01-29 | 3003 | 0.4% |
| 2016-10-03 | 2999 | 0.4% |
| Other values (766) | 754216 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1727194 | |
| - | 1573200 | |
| 1 | 1537463 | |
| 2 | 1281258 | |
| 6 | 599766 | 7.6% |
| 5 | 343454 | 4.4% |
| 7 | 243724 | 3.1% |
| 3 | 166297 | 2.1% |
| 8 | 135891 | 1.7% |
| 9 | 135707 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6292800 | |
| Dash Punctuation | 1573200 | 20.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 1727194 | |
| 1 | 1537463 | |
| 2 | 1281258 | |
| 6 | 599766 | 9.5% |
| 5 | 343454 | 5.5% |
| 7 | 243724 | 3.9% |
| 3 | 166297 | 2.6% |
| 8 | 135891 | 2.2% |
| 9 | 135707 | 2.2% |
| 4 | 122046 | 1.9% |
| Value | Count | Frequency (%) |
| - | 1573200 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7866000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 1727194 | |
| - | 1573200 | |
| 1 | 1537463 | |
| 2 | 1281258 | |
| 6 | 599766 | 7.6% |
| 5 | 343454 | 4.4% |
| 7 | 243724 | 3.1% |
| 3 | 166297 | 2.1% |
| 8 | 135891 | 1.7% |
| 9 | 135707 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7866000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 1727194 | |
| - | 1573200 | |
| 1 | 1537463 | |
| 2 | 1281258 | |
| 6 | 599766 | 7.6% |
| 5 | 343454 | 4.4% |
| 7 | 243724 | 3.1% |
| 3 | 166297 | 2.1% |
| 8 | 135891 | 1.7% |
| 9 | 135707 | 1.7% |
order_hour
Real number (ℝ≥0)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.58879608 |
|---|---|
| Minimum | 0 |
| Maximum | 23 |
| Zeros | 4627 |
| Zeros (%) | 0.6% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 16 |
| median | 18 |
| Q3 | 20 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.357192477 |
|---|---|
| Coefficient of variation (CV) | 0.1908710785 |
| Kurtosis | 5.749711941 |
| Mean | 17.58879608 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -1.749088644 |
| Sum | 13835347 |
| Variance | 11.27074133 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 19 | 134030 | |
| 18 | 129654 | |
| 20 | 108739 | |
| 17 | 90782 | |
| 21 | 68223 | |
| 16 | 48877 | 6.2% |
| 15 | 34286 | 4.4% |
| 22 | 33403 | 4.2% |
| 13 | 31105 | 4.0% |
| 14 | 30323 | 3.9% |
| Other values (14) | 77178 |
| Value | Count | Frequency (%) |
| 0 | 4627 | |
| 1 | 2425 | |
| 2 | 1187 | 0.2% |
| 3 | 443 | 0.1% |
| 4 | 137 | < 0.1% |
| Value | Count | Frequency (%) |
| 23 | 13832 | 1.8% |
| 22 | 33403 | 4.2% |
| 21 | 68223 | |
| 20 | 108739 | |
| 19 | 134030 |
| Distinct | 369 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 24767 |
| Missing (%) | 3.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.436809642 |
|---|---|
| Minimum | 1 |
| Maximum | 369 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 10 |
| 95-th percentile | 39 |
| Maximum | 369 |
| Range | 368 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 17.77232218 |
|---|---|
| Coefficient of variation (CV) | 1.88329773 |
| Kurtosis | 49.04720204 |
| Mean | 9.436809642 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 5.494014541 |
| Sum | 7189273 |
| Variance | 315.8554356 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 244937 | |
| 2 | 96641 | 12.3% |
| 3 | 60532 | 7.7% |
| 4 | 43681 | 5.6% |
| 5 | 34036 | 4.3% |
| 6 | 27603 | 3.5% |
| 7 | 23049 | 2.9% |
| 8 | 19696 | 2.5% |
| 9 | 17013 | 2.2% |
| 10 | 14889 | 1.9% |
| Other values (359) | 179756 | |
| (Missing) | 24767 | 3.1% |
| Value | Count | Frequency (%) |
| 1 | 244937 | |
| 2 | 96641 | 12.3% |
| 3 | 60532 | 7.7% |
| 4 | 43681 | 5.6% |
| 5 | 34036 | 4.3% |
| Value | Count | Frequency (%) |
| 369 | 1 | |
| 368 | 1 | |
| 367 | 1 | |
| 366 | 1 | |
| 365 | 1 |
is_failed
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| 0 | |
|---|---|
| 1 | 24767 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 786600 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 761833 | |
| 1 | 24767 | 3.1% |
| Value | Count | Frequency (%) |
| 0 | 761833 | |
| 1 | 24767 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 761833 | |
| 1 | 24767 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 786600 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 761833 | |
| 1 | 24767 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 786600 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 761833 | |
| 1 | 24767 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 786600 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 761833 | |
| 1 | 24767 | 3.1% |
| Distinct | 911 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.09148909292 |
|---|---|
| Minimum | 0 |
| Maximum | 93.3989 |
| Zeros | 743462 |
| Zeros (%) | 94.5% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.686 |
| Maximum | 93.3989 |
| Range | 93.3989 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4795579176 |
|---|---|
| Coefficient of variation (CV) | 5.241694963 |
| Kurtosis | 3886.352852 |
| Mean | 0.09148909292 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 30.39394065 |
| Sum | 71965.32049 |
| Variance | 0.2299757963 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 743462 | |
| 1.029 | 11647 | 1.5% |
| 1.715 | 11134 | 1.4% |
| 2.058 | 9122 | 1.2% |
| 0.686 | 3648 | 0.5% |
| 1.372 | 1770 | 0.2% |
| 2.744 | 1192 | 0.2% |
| 2.5725 | 897 | 0.1% |
| 3.43 | 543 | 0.1% |
| 0.5145 | 373 | < 0.1% |
| Other values (901) | 2812 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 743462 | |
| 0.00343 | 35 | < 0.1% |
| 0.28469 | 1 | < 0.1% |
| 0.32242 | 1 | < 0.1% |
| 0.343 | 19 | < 0.1% |
| Value | Count | Frequency (%) |
| 93.3989 | 1 | |
| 78.02907 | 1 | |
| 68.3942 | 1 | |
| 61.82575 | 1 | |
| 37.57565 | 1 |
| Distinct | 98 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1811799318 |
|---|---|
| Minimum | 0 |
| Maximum | 9.86 |
| Zeros | 597536 |
| Zeros (%) | 76.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.986 |
| Maximum | 9.86 |
| Range | 9.86 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3697095668 |
|---|---|
| Coefficient of variation (CV) | 2.040565769 |
| Kurtosis | 8.481347092 |
| Mean | 0.1811799318 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.417459196 |
| Sum | 142516.1343 |
| Variance | 0.1366851638 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 597536 | |
| 0.493 | 70617 | 9.0% |
| 0.986 | 35735 | 4.5% |
| 0.7395 | 34790 | 4.4% |
| 0.2465 | 7664 | 1.0% |
| 1.2325 | 7164 | 0.9% |
| 1.479 | 6768 | 0.9% |
| 1.4297 | 5078 | 0.6% |
| 0.46835 | 3097 | 0.4% |
| 0.4437 | 2657 | 0.3% |
| Other values (88) | 15494 | 2.0% |
| Value | Count | Frequency (%) |
| 0 | 597536 | |
| 0.02465 | 10 | < 0.1% |
| 0.0493 | 3 | < 0.1% |
| 0.0986 | 4 | < 0.1% |
| 0.1479 | 303 | < 0.1% |
| Value | Count | Frequency (%) |
| 9.86 | 1 | |
| 7.395 | 1 | |
| 6.6555 | 1 | |
| 6.409 | 1 | |
| 5.916 | 1 |
amount_paid
Real number (ℝ≥0)
| Distinct | 6471 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.18327131 |
|---|---|
| Minimum | 0 |
| Maximum | 1131.03 |
| Zeros | 872 |
| Zeros (%) | 0.1% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4.5135 |
| Q1 | 6.64812 |
| median | 9.027 |
| Q3 | 12.213 |
| 95-th percentile | 19.5408 |
| Maximum | 1131.03 |
| Range | 1131.03 |
| Interquartile range (IQR) | 5.56488 |
Descriptive statistics
| Standard deviation | 5.6181212 |
|---|---|
| Coefficient of variation (CV) | 0.5517010233 |
| Kurtosis | 2243.912588 |
| Mean | 10.18327131 |
| Median Absolute Deviation (MAD) | 2.655 |
| Skewness | 15.5881411 |
| Sum | 8010161.21 |
| Variance | 31.56328582 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.31 | 14667 | 1.9% |
| 7.965 | 14410 | 1.8% |
| 6.372 | 11878 | 1.5% |
| 8.496 | 10350 | 1.3% |
| 6.903 | 9988 | 1.3% |
| 5.841 | 9734 | 1.2% |
| 9.027 | 9213 | 1.2% |
| 7.434 | 9156 | 1.2% |
| 10.62 | 8982 | 1.1% |
| 9.558 | 8377 | 1.1% |
| Other values (6461) | 679845 |
| Value | Count | Frequency (%) |
| 0 | 872 | |
| 0.00531 | 1 | < 0.1% |
| 0.01593 | 1 | < 0.1% |
| 0.02655 | 1 | < 0.1% |
| 0.03717 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1131.03 | 1 | |
| 581.7105 | 1 | |
| 363.01815 | 1 | |
| 353.3805 | 1 | |
| 246.88845 | 1 |
restaurant_id
Real number (ℝ≥0)
| Distinct | 13569 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 162864079.3 |
|---|---|
| Minimum | 73498 |
| Maximum | 340453498 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 73498 |
|---|---|
| 5-th percentile | 29803498 |
| Q1 | 86023498 |
| median | 169613498 |
| Q3 | 228433498 |
| 95-th percentile | 302393498 |
| Maximum | 340453498 |
| Range | 340380000 |
| Interquartile range (IQR) | 142410000 |
Descriptive statistics
| Standard deviation | 87830821.23 |
|---|---|
| Coefficient of variation (CV) | 0.5392890906 |
| Kurtosis | -1.08595334 |
| Mean | 162864079.3 |
| Median Absolute Deviation (MAD) | 71240000 |
| Skewness | -0.02254910338 |
| Sum | 1.281088848 × 1014 |
| Variance | 7.714253157 × 1015 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 37623498 | 1317 | 0.2% |
| 983498 | 1071 | 0.1% |
| 192673498 | 1031 | 0.1% |
| 154543498 | 999 | 0.1% |
| 88773498 | 967 | 0.1% |
| 146723498 | 942 | 0.1% |
| 105253498 | 935 | 0.1% |
| 18603498 | 922 | 0.1% |
| 30633498 | 918 | 0.1% |
| 29593498 | 882 | 0.1% |
| Other values (13559) | 776616 |
| Value | Count | Frequency (%) |
| 73498 | 120 | |
| 123498 | 37 | < 0.1% |
| 153498 | 193 | |
| 173498 | 181 | |
| 193498 | 84 |
| Value | Count | Frequency (%) |
| 340453498 | 1 | |
| 340093498 | 2 | |
| 340033498 | 1 | |
| 339983498 | 2 | |
| 339913498 | 1 |
city_id
Real number (ℝ≥0)
| Distinct | 3749 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47179.7505 |
|---|---|
| Minimum | 230 |
| Maximum | 100205 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 230 |
|---|---|
| 5-th percentile | 10346 |
| Q1 | 24799 |
| median | 46467 |
| Q3 | 67886 |
| 95-th percentile | 89749 |
| Maximum | 100205 |
| Range | 99975 |
| Interquartile range (IQR) | 43087 |
Descriptive statistics
| Standard deviation | 25904.63056 |
|---|---|
| Coefficient of variation (CV) | 0.5490624747 |
| Kurtosis | -1.018564164 |
| Mean | 47179.7505 |
| Median Absolute Deviation (MAD) | 21419 |
| Skewness | 0.05185593619 |
| Sum | 3.711159174 × 1010 |
| Variance | 671049884.7 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 10346 | 86654 | 11.0% |
| 20326 | 36210 | 4.6% |
| 80562 | 34100 | 4.3% |
| 50898 | 21627 | 2.7% |
| 40441 | 16732 | 2.1% |
| 60537 | 14760 | 1.9% |
| 44366 | 14119 | 1.8% |
| 45358 | 11246 | 1.4% |
| 4334 | 11106 | 1.4% |
| 90633 | 10449 | 1.3% |
| Other values (3739) | 529597 |
| Value | Count | Frequency (%) |
| 230 | 993 | 0.1% |
| 1298 | 6519 | |
| 1676 | 77 | < 0.1% |
| 1685 | 33 | < 0.1% |
| 1689 | 18 | < 0.1% |
| Value | Count | Frequency (%) |
| 100205 | 1 | < 0.1% |
| 100079 | 1 | < 0.1% |
| 100061 | 3 | < 0.1% |
| 100048 | 56 | |
| 99999 | 5 | < 0.1% |
payment_id
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| 1619 | |
|---|---|
| 1779 | |
| 1491 | 36497 |
| 1811 | 34492 |
| 1523 | 4878 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3146400 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1779 |
|---|---|
| 2nd row | 1619 |
| 3rd row | 1619 |
| 4th row | 1619 |
| 5th row | 1619 |
| Value | Count | Frequency (%) |
| 1619 | 476600 | |
| 1779 | 234133 | |
| 1491 | 36497 | 4.6% |
| 1811 | 34492 | 4.4% |
| 1523 | 4878 | 0.6% |
| Value | Count | Frequency (%) |
| 1619 | 476600 | |
| 1779 | 234133 | |
| 1491 | 36497 | 4.6% |
| 1811 | 34492 | 4.4% |
| 1523 | 4878 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1368681 | |
| 9 | 747230 | |
| 6 | 476600 | 15.1% |
| 7 | 468266 | 14.9% |
| 4 | 36497 | 1.2% |
| 8 | 34492 | 1.1% |
| 5 | 4878 | 0.2% |
| 2 | 4878 | 0.2% |
| 3 | 4878 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3146400 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 1368681 | |
| 9 | 747230 | |
| 6 | 476600 | 15.1% |
| 7 | 468266 | 14.9% |
| 4 | 36497 | 1.2% |
| 8 | 34492 | 1.1% |
| 5 | 4878 | 0.2% |
| 2 | 4878 | 0.2% |
| 3 | 4878 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3146400 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 1368681 | |
| 9 | 747230 | |
| 6 | 476600 | 15.1% |
| 7 | 468266 | 14.9% |
| 4 | 36497 | 1.2% |
| 8 | 34492 | 1.1% |
| 5 | 4878 | 0.2% |
| 2 | 4878 | 0.2% |
| 3 | 4878 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3146400 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 1368681 | |
| 9 | 747230 | |
| 6 | 476600 | 15.1% |
| 7 | 468266 | 14.9% |
| 4 | 36497 | 1.2% |
| 8 | 34492 | 1.1% |
| 5 | 4878 | 0.2% |
| 2 | 4878 | 0.2% |
| 3 | 4878 | 0.2% |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29868.52938 |
|---|---|
| Minimum | 525 |
| Maximum | 30423 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 525 |
|---|---|
| 5-th percentile | 29463 |
| Q1 | 29463 |
| median | 29815 |
| Q3 | 30231 |
| 95-th percentile | 30359 |
| Maximum | 30423 |
| Range | 29898 |
| Interquartile range (IQR) | 768 |
Descriptive statistics
| Standard deviation | 1160.893265 |
|---|---|
| Coefficient of variation (CV) | 0.03886677012 |
| Kurtosis | 565.3036862 |
| Mean | 29868.52938 |
| Median Absolute Deviation (MAD) | 352 |
| Skewness | -22.53663783 |
| Sum | 2.349458521 × 1010 |
| Variance | 1347673.174 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 29463 | 241523 | |
| 30231 | 216726 | |
| 29815 | 158972 | |
| 30359 | 103653 | |
| 30391 | 24434 | 3.1% |
| 29751 | 19321 | 2.5% |
| 29495 | 11151 | 1.4% |
| 30423 | 6819 | 0.9% |
| 30199 | 2079 | 0.3% |
| 525 | 1094 | 0.1% |
| Other values (4) | 828 | 0.1% |
| Value | Count | Frequency (%) |
| 525 | 1094 | 0.1% |
| 22167 | 3 | < 0.1% |
| 22263 | 232 | < 0.1% |
| 22295 | 1 | < 0.1% |
| 29463 | 241523 |
| Value | Count | Frequency (%) |
| 30423 | 6819 | 0.9% |
| 30391 | 24434 | 3.1% |
| 30359 | 103653 | |
| 30231 | 216726 | |
| 30199 | 2079 | 0.3% |
transmission_id
Real number (ℝ≥0)
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4253.246112 |
|---|---|
| Minimum | 212 |
| Maximum | 21124 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 212 |
|---|---|
| 5-th percentile | 4228 |
| Q1 | 4228 |
| median | 4324 |
| Q3 | 4356 |
| 95-th percentile | 4356 |
| Maximum | 21124 |
| Range | 20912 |
| Interquartile range (IQR) | 128 |
Descriptive statistics
| Standard deviation | 572.8556657 |
|---|---|
| Coefficient of variation (CV) | 0.1346866959 |
| Kurtosis | 176.6261099 |
| Mean | 4253.246112 |
| Median Absolute Deviation (MAD) | 32 |
| Skewness | -0.9114324558 |
| Sum | 3345603392 |
| Variance | 328163.6137 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 4356 | 341734 | |
| 4324 | 203668 | |
| 4228 | 201617 | |
| 4260 | 14538 | 1.8% |
| 212 | 12676 | 1.6% |
| 4996 | 6737 | 0.9% |
| 4196 | 5276 | 0.7% |
| 1988 | 207 | < 0.1% |
| 21124 | 146 | < 0.1% |
| 2020 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 212 | 12676 | 1.6% |
| 1988 | 207 | < 0.1% |
| 2020 | 1 | < 0.1% |
| 4196 | 5276 | 0.7% |
| 4228 | 201617 |
| Value | Count | Frequency (%) |
| 21124 | 146 | < 0.1% |
| 4996 | 6737 | 0.9% |
| 4356 | 341734 | |
| 4324 | 203668 | |
| 4260 | 14538 | 1.8% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| customer_id | order_date | order_hour | customer_order_rank | is_failed | voucher_amount | delivery_fee | amount_paid | restaurant_id | city_id | payment_id | platform_id | transmission_id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 000097eabfd9 | 2015-06-20 | 19 | 1.0 | 0 | 0.0 | 0.000 | 11.46960 | 5803498 | 20326 | 1779 | 30231 | 4356 |
| 1 | 0000e2c6d9be | 2016-01-29 | 20 | 1.0 | 0 | 0.0 | 0.000 | 9.55800 | 239303498 | 76547 | 1619 | 30359 | 4356 |
| 2 | 000133bb597f | 2017-02-26 | 19 | 1.0 | 0 | 0.0 | 0.493 | 5.93658 | 206463498 | 33833 | 1619 | 30359 | 4324 |
| 3 | 00018269939b | 2017-02-05 | 17 | 1.0 | 0 | 0.0 | 0.493 | 9.82350 | 36613498 | 99315 | 1619 | 30359 | 4356 |
| 4 | 0001a00468a6 | 2015-08-04 | 19 | 1.0 | 0 | 0.0 | 0.493 | 5.15070 | 225853498 | 16456 | 1619 | 29463 | 4356 |
| 5 | 0001d9036b5e | 2015-08-29 | 19 | 1.0 | 0 | 0.0 | 0.000 | 11.94750 | 193643498 | 88276 | 1619 | 29463 | 4356 |
| 6 | 0001d9036b5e | 2017-01-04 | 17 | 2.0 | 0 | 0.0 | 0.000 | 11.15100 | 193643498 | 88276 | 1619 | 29463 | 4356 |
| 7 | 0001d9036b5e | 2017-01-28 | 16 | 3.0 | 0 | 0.0 | 0.000 | 9.71730 | 193643498 | 88276 | 1619 | 30359 | 4356 |
| 8 | 0001e1e04d7d | 2015-10-24 | 19 | 1.0 | 0 | 0.0 | 0.000 | 25.22250 | 144833498 | 45358 | 1619 | 29463 | 4356 |
| 9 | 0001e1e04d7d | 2016-03-24 | 19 | 2.0 | 0 | 0.0 | 0.000 | 9.29250 | 95953498 | 45358 | 1619 | 29463 | 4324 |
Last rows
| customer_id | order_date | order_hour | customer_order_rank | is_failed | voucher_amount | delivery_fee | amount_paid | restaurant_id | city_id | payment_id | platform_id | transmission_id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 786590 | fffcf45e5c69 | 2016-11-19 | 12 | 1.0 | 0 | 0.0 | 0.0000 | 12.53160 | 107463498 | 39335 | 1619 | 29463 | 4356 |
| 786591 | fffcf45e5c69 | 2017-02-04 | 12 | 2.0 | 0 | 0.0 | 0.0000 | 11.57580 | 107463498 | 39335 | 1619 | 30359 | 4356 |
| 786592 | fffd696eaedd | 2015-09-14 | 12 | 1.0 | 0 | 0.0 | 1.4297 | 24.13395 | 95323498 | 80562 | 1779 | 29463 | 4356 |
| 786593 | fffe9d5a8d41 | 2016-07-31 | 21 | NaN | 1 | 0.0 | 0.0000 | 8.44290 | 156133498 | 10346 | 1811 | 29463 | 212 |
| 786594 | fffe9d5a8d41 | 2016-09-30 | 20 | 1.0 | 0 | 0.0 | 0.0000 | 10.72620 | 983498 | 10346 | 1779 | 29463 | 4228 |
| 786595 | fffe9d5a8d41 | 2016-09-30 | 20 | NaN | 1 | 0.0 | 0.0000 | 10.72620 | 983498 | 10346 | 1779 | 29463 | 212 |
| 786596 | ffff347c3cfa | 2016-08-17 | 21 | 1.0 | 0 | 0.0 | 0.0000 | 7.59330 | 52893498 | 41978 | 1619 | 30359 | 4356 |
| 786597 | ffff347c3cfa | 2016-09-15 | 21 | 2.0 | 0 | 0.0 | 0.0000 | 5.94720 | 164653498 | 41978 | 1619 | 30359 | 4356 |
| 786598 | ffff4519b52d | 2016-04-02 | 19 | 1.0 | 0 | 0.0 | 0.0000 | 21.77100 | 16363498 | 80562 | 1491 | 29751 | 4228 |
| 786599 | ffffccbfc8a4 | 2015-05-30 | 20 | 1.0 | 0 | 0.0 | 0.0000 | 16.46100 | 150293498 | 45952 | 1619 | 29463 | 4324 |